HIGH PERFORMANCE CRC 
CALCULATION METHOD AND SYSTEM 
WITH A MATRIX TRANSFORMATION 

STRATEGY 



FIELD OF THE INVENTION 

The present invention relates generally to a CRC 
calculation, and, more particularly, to a high performance CRC 
calculation method and system with a matrix transformation 
strategy. 

BACKGROUND OF THE INVENTION 

Cyclic redundancy code (CRC) has been used for a long 
time to preserve the integrity of digital data in storage and 
transmission systems. More particularly, CRC is an important 
error detection tool used for communications and data processing 
applications. The CRC schemes are often used for checking 
integrity of data because they are easy to implement and they 
detect a large class of errors. CRC is a kind of checksum which is 
transmitted with data between a source node and a target node 
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over a communications medium. The source node calculates the 
CRC for the data to be transferred using a predetermined 
polynomial and then transmits the data along with the CRC to the 
target node where the CRC of the received data is independently 
5 generated using the predetermined generator polynomial and 
compared with the CRC received from the source node to check if 
errors have occurred during the transmission. Treating the data 
or message as a binary polynomial, its CRC corresponding to a 
particular generator polynomial may be generated by raising the 

10 message polynomial to a proper power first and then taking the 
remainder of the message polynomial divided by the generator 
polynomial. For CRC generation, data bits are typically serially 
inputted into a CRC generator in order to produce the appropriate 
CRC code for transmission along with the data. Traditionally, 

15 CRC codes are generated with Linear Feedback Shift Register 
(LFSR) circuits. An LFSR takes the input data and shifts through 
a series of flip-flops on successive clock cycles. Combinations of 
the shift register output and data input are fed back to the 
flip-flops via exclusive-OR gates. An LFSR can be defined in 

20 terms of a generator polynomial which relates the input data and 
the CRC code via a polynomial expression and of which "+" is an 
exclusive-OR operation. The state of the flip-flops upon 
completion of the shifting process is the CRC code. 

25 For example, ATM uses a FCS field derived from CRC 
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error detection codes for error checking. The integrity of the 
transmitted or processed message in an ATM system is ensured by 
the addition at the end of the message of the FCS traveling with 
the message itself so it can be checked on the reception side for 
proper transmission. The FCS code has been standardized for 
data integrity checking as described in the ANSI X3. 139- 1987 
document pages 28 and 29. All the CRC codes constitute a finite 
Galois Field (GF), and the CRC 3 2 codes belong to the GF 
generated by the following generator polynomial of degree 32: 

g(x)=x 32 +x 26 +x 23 +x 22 +x 16 +x 12 +x n ^ 
+ 1. 

This generator polynomial of degree 32 was chosen as a standard 
for error checking in Ethernet and then chosen by the ATM 
standard for AAL5 error checking. In the circuitry for calculating 
the FCS or checking the message, an LFSR carries out a bit by bit 
multiplication in the GF, i.e., modulo the polynomial on which GF 
is generated, and by which each bit of the message is inputted into 
the LFSR in the manner of most significant bit (MSB) first and 
division is performed by feedbacks. At the end of the process, the 
FCS, i.e., the remainder of the division, is within the shift 
registers. 

Hardware implementation for CRC generators in large 



scale digital systems is preferred because it is faster. The 
drawback of hardware implementation of CRCs is that more 
hardware is required with consequent increase in cost, size and 
complexity and a decrease in reliability. Software implemented 
CRCs are known although their use is not widespread because of 
the speed penalty thought to be inevitable. Those skilled in the 
art understand that choosing a polynomial of a larger degree will 
result in greater error detection. However, for the applications of 
current large scale systems, the desired hardware becomes too 
complicated and costly to be implemented and the required 
software needs huge computations. Several improvements were 
made for CRC generators. For example, by using CRC routines to 
generate tables consisting of all possible combinations of the 
chosen polynomial, the checksum generation is reduced to a table 
lookup. These CRC routines are considered to be the fastest 
software implementations available, but they take up a great deal 
of dedicated memory. Early CRC implementations use the 
concept of LFSR in which the polynomial division is processed one 
bit at a time. However, the serial processing for the generation of 
the CRC is relatively slow, and as the technology advanced, 
single-bit CRC generation was not enough to handle high-speed 
data processing and transmission, and parallel CRC algorithms 
were then developed to meet this need. The key reason that 
existing CRC algorithms are limited in their degree of parallelism 
is deeply rooted in the concept of LFSRs. All existing algorithms 



try to solve the same problem, i.e., how to parallelize the bit-by-bit 
operation of LFSRs. As a result, the degree of parallelism never 
goes beyond the perceived size of LFSRs. 

5 Accordingly, it is desired a CRC calculation method and 

system to reduce the processing for generation of CRC codes. 

SUMMARY OF THE INVENTION 

10 

The present invention is directed to a methodology to 
simplify the CRC calculation, by which the process for the CRC 
generation is speeded up and the circuitry for the system is 
simplified. 

15 

In a method and system to simplify the CRC calculation, 
according to the present invention, a linear mapping matrix is 
used for the operation of the LFSR to generate the CRC and the 
maximum value of the non-zero entries in the mapping matrix is 

20 reduced by applying one or more raw operations to the linear 
mapping matrix in advance before the computation of mapping 
the input message to a CRC result for the CRC generation. 
Various transformation matrixes are provided for the reduction of 
the maximum value of the non-zero entries in the mapping matrix. 

25 In addition, the input messages are padded with specific dummies 
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on the transmission side or the CRC outputs on the reception side 
are compared with specific patterns in accordance with their 
length types for the doubleword-wise CRC32 case. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other objects, features and advantages of the 
present invention will become apparent to those skilled in the art 
10 upon consideration of the following description of the preferred 
embodiments of the present invention taken in conjunction with 
the accompanying drawings, in which: 

Fig. 1 shows two schemes for a generator polynomial 
15 with message shifted into LFSR from MSB and LSB sides, 
respectively; 

Fig. 2 shows two schemes linearly mapped from those of 

Fig. 1; 

20 

Fig. 3 shows the mapping matrix and its inverse for 
CRC32 generations; 

Fig. 4 shows the circuitries for byte-wise CRC32 
25 generations; 
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Fig. 5 shows the mapping matrix and its inverse for 
byte-wise CRC32 generations; 



5 Fig. 6 shows a circuitry corresponding to a CRC 

calculation with a matrix transformation applied to the Scheme 1 
of Fig. 4; 



Fig. 7 shows a CRC generator with a pipeline 
10 architecture by additionally inserting a 32-bit- wise flip-flops at the 
output of the S transform of the circuitry shown in Fig. 6; 

Fig. 8 shows a CRC checker with a pipeline architecture 
corresponding to the circuitry shown in Fig. 7; 

15 

Fig. 9 shows an excellent solution of the matrix U for 
doubleword-wise CRC32; 



Fig. 10 shows an excellent solution of the matrix S 
20 corresponding to the matrix U of Fig. 9; 



Fig. 1 1 shows an excellent solution of the matrix U for 
the transmission byte-wise CRC32 generator; 



Fig. 12 shows an excellent solution of the matrix S 
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corresponding to the matrix U of Fig. 11; 

Fig. 13 shows a circuitry for the transmission byte-wise 
CRC32 generator corresponding to the matrices U and S of Figs. 
11 and 12; 

Fig. 14 shows an excellent solution of the matrix U for 
the reception byte-wise CRC32 checker; 

Fig. 15 shows an excellent solution of the matrix S 
corresponding to the matrix U of Fig. 14; and 

Fig. 16 shows a circuitry for the reception byte-wise 
CRC32 checker corresponding to the matrices U and S of Figs. 14 
and 15. 

DETAILED DESCRIPTION OF THE INVENTION 

Cyclic code in a systematic form 

As is well-known, an (n, k) linear code C is called a cyclic 
code if every cyclic shift of a code vector in C is also a code vector 
in C. To figure out a cyclic code in a systematic form on the 
transmission side, let the message to be encoded is 
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M=(m k .i...mimo) T , (EQ-1) 
and the corresponding message polynomial is 

5 

m(x)=mox k - 1 +mix k - 2 +. . . +mk-2X+m.k-i . (EQ-2) 

After multiplying m(x) by x n k , equation EQ-2 becomes 

1 0 x nk m(x)=mox nl +mix n - 2 +. . . +rrik-2X n - k + 1 +mk-ix nk . (EQ-3) 

Then, x n k m(x) is divided by the generator polynomial g(x), and it 
becomes 

15 x nk m(x)=q(x)g(x)+r(x). (EQ-4) 

By rearranging equation EQ-4 and inversing the sign of the 
remainder to replace the original one, it will be obtained the 
codeword polynomial 

20 

x nk m(x)+r(x)=q(x)g(x). (EQ-5) 

Obviously, this codeword polynomial is divisible by the generator 
polynomial g(x). 

25 
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From the above description, it can be summarized that 
a cyclic encoding in a systematic form includes: 



Step 1. Multiplying the message m(x) by x n - k ; 

Step 2. Deriving the remainder r(x) by dividing x n k m(x) by the 

generator polynomial g(x); and 
Step 3. Combining r(x) with x n " k m(x) to obtain the codeword 

polynomial x n k m(x)+r(x). 

Likewise, in order to check the integrity of the received 
codeword on the reception side, it is verified if the received 
sequence is divisible by the generator polynomial g(x). 

Shortened cyclic codes 

Given an (n, k) cyclic code C, if the set of the code 
vectors for which the I leading high-order information digits are 
identical to zero, then there are 2 k ' such code vectors and they 
form a linear 'subcode of C. If the I zero information digits are 
deleted, it is obtained a set of 2 k -< vectors of length n-l These 
shortened vectors form an (n-l, k-l) linear code, and which code is 
called a shortened cyclic code and is not cyclic. 

implementation of divisor 
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No matter for a cyclic code encoding or decoding, a 
divisor of Galois Field GF(2) is needed. For example, a simple 
Linear Feedback Shift Register (LFSR) is employed to implement 
the divisor. Furthermore, depending on the dividend sequence 
shifted into LFSR either from MSB side or Least Significant Bit 
(LSB) side, there are two schemes for implementation of a divisor, 
i.e., 

Scheme 1: Message is shifted into LFSR from MSB side, which is 
mathematically equivalent to 

m(x)x nk modg(x) (EQ-6) 

Scheme 2: Message is shifted into LFSR from LSB side, which is 
mathematically equivalent to 

m(x)modg(x) (EQ-7) 

For illustration, two circuitries are shown in Fig. 1 for these two 
schemes for the generator polynomial g(x)=x 3 +x 2 +l. 

Linear mapping 

Further, the linear feedback shift registers shown in Fig. 
1 can be regarded as a linear mapping mathematically, as shown 
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in Fig. 2. For the same generator polynomial g(x)=x 3 +x 2 +l, it can 
be derived the G mapping: 

(EQ-8a) 
(EQ-8b) 
(EQ-8c) 

and this linear mapping can be represented in a matrix form as 



>o(2)" 




~g,(2) 




= G 




_*>(<>)_ 







go(2)=gi(2)egi(l), 
go(l)=gi(0), and 
go(0)=gi(2), 



where 



0 


1 


0 


1 


0 


1 


1 


0 


0 



(EQ-10) 



and trivially, the matrix G is invertible and its inverse matrix is 
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G' = 



0 0 1" 

1 0 0 
0 1 1 



(EQ-11) 



Based on the Scheme 1 and 2, there exist recursive 
equations between the output of the D-type flip-flops of the 
5 polynomial generator g(x) and the input of the encoded message, 
respectively, as 

Scheme 1: 

10 R(k)=G(R(k-l)+M(k-l)), and (EQ-12) 

Scheme 2: 

R(k)=GR(k-l)+M(k-l). (EQ-13) 

15 

Further tracing the output of the D-type flip-flops, i.e., the 
remainder of a division, in Scheme 1, it results in 

R(0)=I> (EQ-14a) 

20 R(1)=G(R(0)+M(0))=GI+GM, (EQ-14b) 

R(2)=G(R(1)+M(1)) 

=G2I+G2M(0)+GM(1), (EQ-14c) 
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R(k)=G(R(k-l)+M(k-l)) 

=G k I+G k M(0)+G k - 1 M(l) + ...+GM(k-l) (EQ-14d) 



5 Generation of FCS 

In Standard 802.3, the CRC32 is employed to generate 
FCS and the generator polynomial is 

10 g(x)=x 3 2+x 26 +x 23 +x 22 +x 16 +x 12 +x 11 +x 10 +x 8 +x 7 + 

x 5 +x 4 + x 2 +x+ 1 . (EQ- 1 5) 

Mathematically, the CRC value corresponding to a given frame is 
defined by the following procedures: 

15 

a. ) The first 32 bits of the frame are complemented; 

b. ) The k bits of the frame are then considered to be the 

coefficients of a polynomial m(x) of degree k- 1 ; 

c. ) m(x) is multiplied by x 32 and divided by g(x), producing a 
20 remainder r(x) of degree less than or equal to 3 1 ; 

d. ) The coefficients of r(x) are considered to be a 32-bit sequence; 

and 

e. ) The bit sequence is complemented and the result is the FCS 

f(x). 
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In the procedure a, disclosed are two implementation methods: 

Method 1: complementing the first 32bits of the message directly; 
and 

5 Method 2: initiating the D-type flip-flop with 1 specific value, e.g., 
Oxffffffff for the Scheme 1 and 0x46af6449 for the 
Scheme 2. 

The mapping matrix G and its inverse matrix G 1 are shown in Fig. 
10 3. 

On the reception side, when the whole of frame is 
acquired, the output of the Scheme 1 CRC checker is compared 
with the value of 0xc704dd7b to examine the integrity of the 
15 received frame. The reason is explained herewith. 

Let the transmitted message (except for FCS) is represented in a 
polynomial form 

20 m(x)=mox k ^ +mix k - 2 +. . . +m k -2X+mk-i , (EQ- 1 6) 

and defining a polynomial c(x) of degree 31 with all of its 
coefficients to be 

25 c(x)=lx?* + lx30+ m „+i x 2+ lx +j 9 (EQ-17) 
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then the remainder r(x) generated by the procedure c will be 



r(x)=(m(x)+c(x)x k - 32 )x 32 modg(x) . (EQ- 18) 

After performing procedures d to e, it will generate FCS and the 
transmitted sequences 



f(x)= r(x), and (EQ-19) 
n(x)= m(x)x 32 +f(x) (EQ-20) 

On the reception side, if the integrity of this frame sequence is 
maintained, then the remainder of a division will be 

s(x)=(n(x)+c(x)x k )x 32 modg(x) 

=(m(x)x 32 +f(x)+c(x)x k )x 32 modg(x) 

=(m(x)x 32 +c(x)x k +f(x))x 32 modg(x) (EQ-2 1) 

From equation EQ-18, equation EQ-21 can be further modified to 
be 

s(x)=(q(x)g(x)+r(x)+f(x))x 32 modg(x) 
=(r(x)+f(x))x 32 modg(x) 
=c(x)x 32 modg(x) 

=[0xc704dd7b][x 31 ...x 1 1] (EQ-22) 
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Based on a similar derivation, it can be further obtained, 
if the Scheme 2 is adopted, that the checking pattern will be the 
value of Oxffffffff . 



Parallelized CRC calculation 



So far the encoding message is sequentially inputted to 



the CRC calculation with one bit each time, however, for 
high-speed applications, CRC calculation is desired for the 
capability of multiple message bits inputted, e.g., byte-wise, at a 
time to increase the throughput. Consequently, the principal 
architecture of the previous proposed two schemes is maintained 
and there is only somewhat difference at the mapping matrix. 

Let the input message and the status of the flip-flops be 
represented, respectively, with a vector form as 



M(k)=[m k 0 ... 0]T, and 
R(k)=[r k 31 r k 30 ...ifcop. 



(EQ-24) 



(EQ-23) 



Tracing the R(k) influenced by the values of M(k), 



initially, R(0)=0, 



(EQ-25a) 



then, 



R(1)=G(R(0)+M(0))=GM(0), and 



(EQ-25b) 



17 



10 



R(2)=G(R(1)+M(1))=G 2 M(0)+GM(1). (EQ-25c) 
It can be further verified 

G 2 M(0)=m 0 xthe 1st column of the G 2 matrix, and 

(EQ-26a) 

GM(l)=mi><the 1st column of the G matrix. (EQ-26b) 
Defining a new vector with I (^32) non-zero entries as 

Mz(k)=[m k 11 mic 1 " 2 ...m k ° 0 ...0] T (EQ-27a) 
=[m k *i niki+i ...m k *i+i-i 0 ...0] T (EQ-27b) 

When 1=2 and k=0, it becomes 

M 2 (0)=[m 0 mi ...0 ...0] T , (EQ-28) 

and it is further derived 

20 G 2 M 2 (0)=mo x the 1st column of the G 2 matrix + 

mixthe 2nd column of the G 2 matrix. (EQ-29) 

Examining the property of the G matrix, it can be found 

25 the 1 st column of the G matrix = 
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the 2nd column of the G 2 matrix. 



(EQ-30) 



Hence, it is obtained 

G2M 2 (0)=G2M(0)+GM(1), (EQ-31) 
and it is further included that 

GHVIz(0)=G'M(0)+G^M(l)+...+GM(Z-l), for J^32. (EQ-32) 

When the message is inputted iri byte-wise form at a 
time, the input message and calculated remainder vectors are 
represented as 

M 8 (k) = [m^ m< ... m° k 0 ... 0]f x32 (EQ-33) 
R(k) = [rr rf .-.rJir* (EQ-34) 

If the Scheme 1 is adopted, the recursive equation of the input 
message and calculated remainder is 

R(k+1) = T(R(k)+M 8 (k)), (EQ-35) 

and the circuitry is shown in Fig. 4. 

Likewise, if the Scheme 2 is adopted, the recursive 
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equation will be 



R(k+1) = TR(k)+M 8 (k), (EQ-36) 

5 and its circuitry is also shown in Fig. 4. 

For equation EQ-35 and 36, the mapping matrix T and 
its inverse are shown in Fig. 5, and of which, the number on the 
right-hand side of each row indicates how many nonzero entries 

10 that row has. For example, in the matrix T, Row 1 has 4 nonzero 

entries and those rows with maximum nonzero entries, the value 
of 7, are Row 5,12 and 13. In the T matrix of Fig. 5, for a specific 
row, the number of nonzero entries subtracting one is equivalent 
to how many GF(2) adder (modulo 2) needed. If the maximum 

15 value of the non-zero entries in the T matrix is reduced, the 

operation speed of the CRC circuitry could be promoted further, 
due to the dramatic reduction of computations in the matrix 
multiplication or iteration procedure. 

20 Similarity of matrices 

If there are two matrices T and U with the relationship 
U=STSi, (EQ-37) 

25 
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where S is invertible, then the matrix U is similar to the matrix T. 
Based on the similarity of matrices, there can be obtained two 
properties 



a) Tk=S-iUkS, and 

b) U is invertible if T is invertible. 

Substitution of the properties into equation EQ-12 and 
replacement of G with T will result in 

R(k) = S-iUS(R (k-l)+M(k-l)), (EQ-38a) 
SR(k) = U(SR(k-l)+SM(k-l)). (EQ-38b) 

Let 

R{k) = SR(k), and 
M(k) = SM(k), 

then 

R(k) = U(tf(k-1)+M(k-1)). (EQ-40) 
The circuitry corresponding to equation EQ-40 is shown in Fig. 6. 

Further, without effecting the mathematical equality, a 



(EQ-39a) 
(EQ-39b) 
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32-bit-wise flip-flops is additionally inserted at the output of the S 
transform of the circuitry shown in Fig. 6, in order to form a 
pipeline architecture as shown in Fig. 7. 



5 On the other hand, if the CRC calculation is applied on 

the reception side, an alternative method is proposed to omit the 
practical hardware operation of S" 1 R(i) for the comparison with Pi, 
i.e., to compare the output of the flip-flops thereof with the value 
of SPi, which can be carried out in advance. The resultant 
10 circuitry is shown in Fig. 8. 

For convenience, let W (T) represent the maximum 
value of non-zero entries among all rows of the matrix T. To 
analyze the circuitry of Fig. 7, if both W (U) and W (S) are less than 

15 W(T), an advantage will be acquired that the CRC calculation can 

operate in a higher clocking. To find such matrices U and S, one 
strategy is proposed that the matrix S is produced by the 
multiplication of some basic row operation matrices R(i,j), whose 
function is to add Row i with Row j to thereby produce a new Row 

20 j. For a 3x3 matrix T, for example, to generate another matrix U 
whose Row 0 and 1 are the same as those of the matrix T and 
whose Row 2 is the summation of Row 0 and 2 of the matrix T, the 
matrix U can be acquired by multiplying the matrix T in the left 
side by a row operation matrix 

25 
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R(0,2) = 



1 0 0 

0 1 0 

1 0 1 



(EQ-41) 



Moreover, the invertible matrix of R(o,2) will be itself, i.e., 



R(0,2)*R(0,2) = 



1 0 0 
0 1 0 
0 0 1 



= I, and 



(EQ-42) 



thus 



R (o',2) = R (0,2) 



(EQ-43) 



Searching algorithm for S and U matrices 



Step 1: Let S(0) = I, S-i(O) = I, U(0) = T and k = 0. 
Step 2: If there exists a row operation matrix R ( i,j) so that both 
W(R(i,j)U(k)R(i,j))< ¥(U(k)) and W (R(ij)S(k)) ^ W(T), then 

goto Step 3, else goto Step 4. 
Step 3: S(k+1) = R(i,j)S(k), S-i(k+l) = S-i(k)R(i,j), 

U(k+1) = R(i,j)U(k)R(i,j), and k = k+1, 

then goto Step 2. 
Step 4: Let S = S(k), S-i = S-i(k), and U = U(k), 

then STS- 1 = U, W (U) < W (T) , and W (S) ^ W (T) . 
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Doubleword-wise CRC 32 

After searching by running the program to implement 
5 the above steps, there are total 242 solutions of S and U matrices 
satisfied the criteria in the case of doubleword-wise CRC32, 
among them an excellent solution is shown in Figs. 9 and 10. 

Other than the case of the message inputted in 
10 byte- wise form, however, the MAC frame is based on octet format, 
and the length of the processed message is not always divisible by 
4. As a result, some dummies are padded on the message in 
order to have the doubleword-wise format when the 
doubleword-wise CRC calculation is employed. Two strategies 
15 for dummy padding are further proposed: 

Strategy 1 : padding with some zero-valued octets before the prefix 
of the processed message for the transmission side; 
and 

20 Strategy 2: padding with some zero- valued octets after the suffix of 

the processed message for the reception side. 

When the message is inputted in doubleword-wise form 
at a time, the input message and calculated remainder vectors are 

25 



24 



M 32 (k)=[mf m» ... mj]^, and 



R(k)-[r» r» ... r°] 



,0 i r 

I 1x32 



(EQ-44a) 
(EQ-44b) 



Similar to the byte-wise case, the recursive equation R(k) for the 
Scheme 1 and 2 are 



R(k+1) = T(R(k)+M 32 (k)), and 
R(k+1) = T R(k)+M 32 (k). 



(EQ-45a) 
(EQ-45b) 
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In the doubleword-wise case, no matter what the length of a frame 
is, they can be classified in accordance with their length into four 
types: 4n, 4n+l, 4n+2 and 4n+3. If the Strategy 1 is adopted, the 
initial values of the flip-flops will vary with the length type as listed 
in 

Table 1 



Length type 


Padding number 


C(0) 


C(l) 


4n+3 


1 


00 ff ff ff 


ff 00 00 00 


4n+2 


2 


00 00 ff ff 


ff ff 00 00 


4n+l 


3 


00 00 00 ff 


ff ff ff 00 


4n 


4 


00 00 00 00 


f f f f f f f f 



The recursive equations are 
R(l) =TC(0), and 



(EQ-46a) 



25 



R(2) = T(R(1)+C(1)) = T2C(0)+TC(1) 

Let the initial values of the flip-flops be R(0), so that 

5 T2R(0) = T2C(0)+ TC(1), or 

R(0) = C(0)+T-iC(l), 

and R(0) is listed in 

10 Table 2 



Length type 


The initial value of R(0) 


4n+3 


0x9bfla90f 


4n+2 


0x09b93859 


4n+l 


0x816474c5 


4n 


0x46af6449 



In the strategy 2, the resultant output of the flip-flops 
will vary with the length of the processed frame, which result 
implies, for examining the integrity of a received frame, the output 
15 Pi is compared with a specified pattern depending on the length 
type i, in the following rule 

Pi = G 8i[0xc704dd7b]T, for i = 1, 2, 3 and 4, (EQ-48) 
20 and the pattern in 
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(EQ-46b) 



(EQ-47a) 
(EQ-47b) 



Table 3 



Length type 


Padding number 


Pattern (Pi) 


4n+3 


1 


0x4710bb9c . 


4n+2 


2 


0x3a7abc72 


4n+l 


3 


0x8104c946 


4n 


4 j 


0x69044bb59 



If the method 1 is adopted, from Table 2 and the relationship of 

R (k) = SR(k), it is obtained the initial value of the flip-flops in this 
transform-type CRC calculation as 



Table 4 



Length type 


Initial value of R(0) 


4n+3 


0x9bflal0b 


4n+2 


0x09b9385d 


4n+l 


0x816474c5 


4n 


0x46af744d 



Likewise, if the method 2 is adopted, the checking pattern, based 
on the transformation and Table 3, will be 



Length type 



Table 5 
Padding member 



Pattern 
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4n+3 


1 


0x4710a398 


4n+2 


2 


0x3a7abc72 


4n+l 


3 


0x8104dl46 


4n 


4 


0x6904b35d 



The comparison of normal and transform-type CRC 
calculations is summarized in 



Table 6 





W(T)/W(U) 


W(S) 


2XOR 
needed in 
T/U 


2XOR 
needed in 
S 


2XOR 
needed in 
S 1 


Normal 


17 


NA 


420 


NA 


NA 


Transform 


15 


3 


394 


4 


4 



Byte-wise CRC32 

After searching by running the program, there are more 
than 10 million solutions of S and U matrices satisfied the criteria 
in the case of byte-wise CRC32, among them two excellent 
solutions are provided, one for the transmission side and the other 
for the reception side. For the excellent solution of the 
transmission CRC generator, Figs. 11 and 12 show the matrices 
Utx-08 and Stx-os, and Fig. 13 depicts the corresponding circuitry. 
For the excellent solution of the reception CRC checker, Figs. 14 
and 15 show the matrices Urx-os and S rx -08, and Fig. 16 depicts the 
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corresponding circuitry. 

From equation EQ-40, the initial value of the flip-flops 
can be calculated out. Let 

r (4) = (U 4 +U3+tP+U) M (0). (EQ-49) 

Assume 

tf(4) = U^(0), (EQ-50) 

then 

R{0) = (I+U-i+U-2+U-3)M(0). (EQ-51) 

The comparison of normal and transform- type CRC calculations is 
summarized in 



Table 7 





W(T)/W(U) 


W(S) 


2XOR 
needed in 
T/U 


2XOR 
needed in 
S 


2XOR 
needed in 


Normal 


7 


NA 


106 


NA 


NA 


Tx 

Transform 


5 


3 


80 


0 


11 


Rx 

Transform 


5 


4 


78 


0 


0 
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While the present invention has been described in 
conjunction with preferred embodiments thereof, it is evident that 
many alternatives, modifications and variations will be apparent 
to those skilled in the art. Accordingly, it is intended to embrace 
all such alternatives, modifications and variations that fall within 
the spirit and scope thereof as set forth in the appended claims. 
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